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Introduction 



This compendium contains exercises about regular languages for the course Formal Languages, Au- 
tomata and Theory of Computation at the School of Innovation, Design and Technology, Malardalen 
University. The notation used throughout this compendium is taken from the Swedish course book 
1 , and is described on the next page. All exercises has a solution in the end of the compendium, 
including those marked with k*’; Those marked with "S’; are test assignments. 

The compendium is outlined as follows. There are five sections; Mathematical preliminaries. 
Regular expressions. Finite Automata, Regular grammar and non-regularity. Each section starts with 
some simple exercises intended to learn new concepts. The difficulty of the exercises then increases, 
and in the end there are test assignments. 



Study Technique 

• Discuss your solutions in the group! The other students are your biggest support in this course, 
use that resource! 

• Try to solve the easy exercises in the beginning of each section! 

• Don’t look at the solution until you have tried to solve an exercise. If you cannot solve it, look 
at the solution and try to grasp the basic idea of the solution and try to solve it again. You won’t 
learn anything from just copying a solution! 

• To ensure that a property holds for a certain automation, grammar, expression etc. it might help 
to elaborate with different strings. Try to construct strings which are violating what you are 
trying to prove! 

Good luck with the course! 



'Lennart Sailing, Formella sprak, 



1 berakningar, 2001 
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Notation 



Notation 


Meaning 


Example 


Other 

notation 


a 


The character ’a’ 


a,b,c 


a,b,c 


foo 


The string "foo" 


bar, thing 


bar, foo 


w,x,y,z 


Arbitrary string 


w € E* 




£ 


The empty string 




A 


k 


Numerical constant 


k,- 3,1,42 




n,m 


Numerical variable 


n = 2m 




L, S, A, B 


Language, set 


L = E* 




0 


The empty set 




{} 


E 


An alphabet 


E = {a,b} 




{w I foo} 


The set of all w such that /bo holds 


{w\ vT = w} 




a fc 


k repetitions of the character a 


a 4 = aaaa 


a k ,a A 


w k 


k repetitions of the string w 


w 3 = WWW 




A k 


The set of all strings with k arbitrary el- 


{1, 2} 2 = {11,12,21,22} 




A* 


ements from the set A 
Kleene star operation on A 


{0, 1}* »{e, 0, 1, 00, . . 




A+ 


Plus operator on the set A 


a* = (si a, aa, . . 

(foo)* = {e, foo, foof oo, . . .} 
{0, 1}+ = {0,1,00,01,10,...} 




w m 


Reverse of the string w 


(foobar) rev = raboof 


w R 


L 


The complement language to L 




-( l),l c 


2 a 


The powerset of A 


2<°’ 1 > = {0, {0}, {1}, {0, 1}} 




\A\ 


Number of elements in A (sometimes 


| { 1, 2, 3}| = 3, |automation| = 7 




M» 


denoted cardinality) 

The number of a’s in the string w 


abracadabra a = 5 


n a {w),#a{w) 


prefix (w) 


The set of strings x such that w = xz 


prefix (abc) = {e, a, ab, abc} 




pprefix(w ) 


The set of proper prefixes to w 


prefixfabc) = {a, ab} 




prefix (L) 


The set of strings w such that w is a pre- 


prefix (E*) = E* 




suffix (w) 


fix in some string in L 

The set of strings x such that w = zx 


suffix(abc) = {e, c,bc, abc} 




psuffix{w) 


The set of proper suffixes to w 


psuffix(abc) = {c, be} 




suffix(L) 


The set of strings w such that w is a suf- 


suffix (E*) = E* 




AUB 


fix in some string in L 
The union of A and B 


{1, 2, 3} U {2, 3, 4} = {1,2, 3, 4} 


A + B 


AnB 


The intersection of A and B 


{1, 2, 3} fl {2, 3, 4} = {2,3} 




xy 


The concatenation of the strings x and 


x = foo, y = bar, xy = f oobar 




A-B 


y 

The set of all elements which are in A 


{1,2,3} - {2,3,4} {1} 


A\B 


AxB 


but not in B 

The set of all combinations of an ele- 


{a, b} x {c, d} = {ac, ad, be, bd} 




# 

N 


ment in A concatenated with an element 
in B. 

The end of a string/stack 
The set of natural numbers 


N = {0,1,2,...} 




Z 


The set of integers 


Z = {. . . , —2, — 1, 0, 1, 2, . . .} 




R 


The set of real numbers 


47,011 € R 




□ 


Ergo, hence, therefore 
End of proof 




■ 
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1 Mathematical Preliminaries 



Exercise 1.1. (oi) 

Construct the powerset for the following sets. 

(i) {a,b} 

(ii) {0,1} U {1,2} 

(iii) {z} 

(iv) {0, 1,2, 3,4} fl {1,3, 5, a} 

(v) {0, 1,2,3} — {1,3,5, o> 

(vi) 0 (the empty set) 

Exercise 1.2. (02) 

Determine the cardinality of the following languages over the alphabet E = {0,1} (That is, are they 
finite, infinite and countable, or infinite and uncountable). Prove your answers. 

(i) E° 

(ii) E 4 

(iii) E* = E° U E 1 U E 2 U E 3 U • • • 

(iv) 2 s 

(v) 2 s *, that is the powerset of all languages over E*. (Hint: use a diagonalisation argument!) 

Exercise 1.3. (03) 

Find a possible alphabet E for the following languages. A word f oobar should be interpreted as a 
string of characters f , o, o, b , a and r. 

(i) The language L = {oh, ouch, ugh} 

(ii) The language L = {apple, pear, 4711} 

(iii) The language of all binary strings 

Exercise 1.4. (04) 

Describe what the Kleene star operation * over the following alphabets produces. 

(i) E = {0, 1} 

(ii) E = {a} 

(iii) E = 0 (the empty alphabet) 

Exercise 1.5. (05 ) 

Describe what the + -operation over the following alphabets produces. 

(i) E = {0, 1} 

(ii) E = {a} 

(iii) E = 0 (the empty alphabet) 
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Exercise 1.6. 

State the alphabet £ for the following languages : 



(06 ) 



(i) L = £* = {e, 0, 1, 00, 01, 10, 11, 000, 001, 010, Oil, . . .} 

(ii) L = £+ = {a, aa, aaa, . . .} 

(Si) L = £+ = {£} 

Exercise 1.7. (o7) 

Assuming that £ = {0,1}, construct complement languages for the following. 

(i) {010,101,11} 

(ii) £* - {110} 

(iii) £+ - £ 

Exercise 1.8. (08) 

State the following languages explicitly. 

(i) 2^ a ’ b > - 2^* 

(ii) {a, b} x {1,2,3} x 0 

(iii) {x | x £ N A 3y £ N : y < 10 A (y + 2 = x)} (N is the set of all non-negative integers) 

Exercise 1.9. (o9) 

Let £ be an alphabet, and let e be the empty string over £. 

(i) is £ in £? 

(ii) does it hold that ese = el does it hold for e l where i> 21 

(iii) Let x and y be two strings over £. Is the concatenation of x and y always the same as the 
concatenation of y and xl 

Exercise 1.10. (oio) 

A string is infinite when its length is infinite. Let £ be an arbitrary alphabet. 

(i) Does £* contain any infinite string? 

(ii) If £ would be an infinite alphabet (which it actually may not be), would the answer still be the 
same? 

Exercisel.il. (074) 

Prove that 6 n = 0 (mod 9) for all integers n > 2. 

Exercise 1.12. (075) 

Prove by induction that ,S' 2 = 2 1 s I (that is, that the size of the powerset is equal to 2 to the power of 
the size of the original set) for all finite sets S. 
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Exercise 1.13. 



(o76) 



Let x be a string and let x rev be “ the same” string but backwards. Prove that ( xy) rev = y rev x rev for 
arbitrary strings x, y over an alphabet S. (Hint: try to define x rev inductively) 

Exercise 1.14. (012) 

Prove by induction that \Ax B\ = bl ■ \B\ for all sets A and B. 

Exercise 1.15. (013) 

Prove that 2° + 2 1 + 2 2 H h 2” = 2 n+1 - 1 for all integers n > 0. 
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Solutions 



Solution 1.1. (oi) 

(i) 2 s = {0, {a}, {6}, {a, 6}} 

(ii) 5 = {0, 1, 2}, 2 s = {0, {0}, {1}, {2}, {0, 1}, {0, 2}, {1, 2}, {0, 1, 2}} 

(Mi) 2 s = {0, {z}} 

(iv) S = {1,3}, 2 s = {0, {1}, {3}, {1,3}} 

(v) S = (0, 2}, 2 s = {0, {0}, {2}, {0, 2}} 

(vi) 2 s = {0} 

Solution 1.2. ( 02 ) 

(i) £° = {e} The set is finite. 

(ii) 

£ 4 = { 0000 , 0001 , 0010 , 0011 , 0100 , 0101 , 0110 , 0111 , 

1000 , 1001 , 1010 , 1011 , 1100 , 1101 , 1110, 1111 } 

The set is finite. 

(iii) Each set in the infinite and countable series is finite and may therefore be counted by mapping it 
to the natural numbers. Thereby, the full set can be counted. .'. The set is infinite and countable. 

(iv) 2 s = { {a}, {0}, {1}, (0, 1} } .'. The set is finite. 

(v) .'. The set is infinite and uncountable. 

Solution 1.3. (o3 ) 

(i) £ = {o, h, u, c, g} 

(ii) £ = (a, p, 1, e, r, a, 4, 7, 1} 

(iii) £ = (0, 1} 

Solution 1.4. (o4 ) 

(i) All binary strings. 

(ii) All strings which contains nothing but a’s (including the empty string). 

(iii) The language which contains only the empty string. 

Solution 1.5. (o5 ) 

(i) Binary strings containing at least one character. 

(ii) Strings containing only a’s, with at least one character. 

(iii) The language which contains only the empty string. 




Solution 1.6. 



(06 ) 



(i) E = {0, 1} 

(ii) E = {a} 

(iii) E = 0 

Solution 1.7. (o7) 

(i) L = {e, 0,1, 00, 01, 10, 000, 001, Oil, 100, 110, 111, 0000, . . .} 

(ii) L = {110} 

(iii) L = {e} 

Solution 1.8. ( 08 ) 

(i) {{b},{a,b}} 

(ii) 0 

(iii) {2,3,4,5,6,7,8,9,10,11} 

Solution 1.9. (o9) 

(i) No. 

(ii) Yes. Yes. 

(iii) No, consider for instance E = {a, b}, x = a, y = b, xy = ab, yx = ba. 

Solution 1.10. (oio ) 

(i) No, even though the language is infinite, all strings have finite length. 

(ii) No, each symbol in the alphabet still has finite length, and each string has finite length. 

Solutionl.il. ( 074 ) 

Claim: 6” = 0 (mod 9) for all integers n> 2. 

Verification: 



6 2 = 36 = 9 • 4 , 6 3 = 216 = 9 • 24 , 6 4 = 1296 = 9-144 

heltal heltal heltal 

Base step: We choose 6 2 , that is n = 2 as base case, since this is the least n for which the claim 
must hold. Since 6 2 = 36 = 9 • 4, the claim holds for the base case. 

Induction step: We assume that the claim is true for a certain case and try to show that it is true for 
the following case as well. 

Thus, assume that 9 divides 6 P , that is 6 P = 9 k for some integer k. We want to show that this 
implies that 6 P+1 = 9£ for some integer t. 
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If 6 v = 9k for some p and some integer k, then the following holds. 



6 P+1 = 6 • 6 P 
= 6-9 k 
= 9 (6k) 

integer 

= 9t where £ is an integer 

Summary: The claim holds for n = 2, and if it holds for some case p, then it also holds for the case 
p + 1. Thus, it holds for all cases, which concludes the proof. □ 

Solution 1.12. ( 075 ) 

Claim: |5 2 1 = 2l ,s l for all finite sets S. 

Verification: 



| 0 2 | = 1 = 2 ° = 2 |01 

|{a;} 2 | = 2 = 2 1 = 2 I ^> 1 

|{x, y} 2 \ = 4 = 2 2 = 2^>l 



Base step: We choose 5 = 0 as base step, since this is the least set for which the claim should hold. 
According to the verification above, the claim should hold for the base step. 



Induction step: We assume that the claim is true for a certain case, and try to prove it for the next 
case. 

Thus, assume that |5 2 | = 2 I s for some set S. We want to show that this implies that (5 LI 
{a;}) 2 | = 2l' s-LJ -MI for an arbitrary element x(£S. 

If |5 2 | = 2 I s I for some set 5, then the following holds. 



(S U {x}) 2 
|(5U{*}) 2 | 



S 2 U T where T = {t U {a;} | t e S 2 } 

\S 2 UT\ 

|5 2 | + \T\ since S 2 and T are disjoint (x & S — > x & 2 s ) 
|5 2 | + |{tu{x}|te5 2 }| 

|5 2 | + |5 2 | 

2 • | 5 2 | 

2 • 2 I s I (from the inductive hypothesis) 

2 |si+l 

2 1 suM I s i nce x g S 



Summary: The claim holds for 5 = 0, and if the claim holds for some set S then it also holds for 
the set S U {a:} where x ^ S. Thus, it holds in all cases and the proof is concluded. □ 

Solution 1.13. (o76 ) 

s rev = e for the empty string e. a rev = a for a string with containing only a single character. A non- 
empty string x can be partitioned into two substrings x = ay where a contains one or zero characters. 
We can then define ( ay) rev = y m a where |a| < 1. 

Now we will show that (xy) rev = y rev x m . First assume that x is the empty string. If so, the claim 
is true, since ( sy) rev = y rev = y rev e. Now assume that a; is a string a containing only one character. If 
so, the claim is true as well, since ( ay) rev = y rev a by definition. These are our base steps. 
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Now we assume that the claim (uv) rev = v rev u rev holds for two strings u,v. This is our inductive 
hypothesis. Now we want to show that (auv) rev = v rev u rev a. 

(■ auv) rev = (uv) rev a by definition. Furthermore, (uv) rev = v rev u rev by inductive hypothesis, which 
yields ( uv) rev a = v rev u rev a, which is what we wanted to show. 

Since we have shown that the claim holds for a (two) base cases, and that it holds for any string 
which may be constructed by concatenating a character to the beginning of the first string, the claim 
holds for all strings. Et 

Solution 1.14. (012 ) 

\Ax B\ = | A\ ■ \B\ for the empty string A = 0 and an arbitrary set B. The claim also holds for A = 0 
and an arbitrary B. Now assume that the claim holds for two sets A and B (the induction hypothesis). 
We want to show that \A U {x} x B | = \A U {a:}| ■ \B\ for an arbitrary element x £ A (if x e A the 
claim holds trivially, since a; e A — + A U {x} = A). 

We start with the left hand side. The cross product implies that A U {x} x B = Ax B U {x} x B. 
Since Ax B and \x} x B are disjoint (since x £ A), it holds that |.4xBU \x} x B\ = \A x B\ + 
\{x}xB\ = \AxB\ + \B\. 

For the right hand side it holds that \A U {a;}| = \A\ + 1. Thus, we have | A U {a;} ■ \B\ = 
(|A| + 1)-|B| = \A\-\B\+\B\ = \AxB\+\B. But then it also holds that |Au{a;}x.B = |Au{®}|-|B|, 
which concludes the proof. □ 

Solution 1.15. ( 013 ) 

Claim: 2° + 2 1 + 2 2 H 1- 2" = 2 n+1 — 1 for all integers n > 0. 

Verification: 



2 ° = 1 = 2 1 — 1 = 1 
2° + 2 1 = 3 = 2 2 — 1 = 3 
2° + 2 1 + 2 2 = 7 = 2 3 - 1 = 7 

Base step: We choose n = 0 as base case, since this is the least number for which the claim should 
hold. According to the verification above, the claim holds for the base step. 

Induction step: We assume that the claim holds for a certain case, and try to prove that it holds also 
for the following case. 

Thus, assume that 2° + 2 1 + 2 2 + \-2 k = 2 k+[ — 1 for some number k. We want to show 

that this implies that 2° + 2 1 + 2 2 H h 2 fc + 2 k+1 = 2 k+2 - 1. 

If 2° + 2 1 + 2 2 + • • • + 2 k = 2 k+1 - 1 for some number k, then the following holds. 

2° + 2 1 + 2 2 + • • • + 2 k + 2 k+1 = (2 k+1 - 1) + 2 k+1 (from the inductive hypothesis) 

= 2 k+1 + 2 k+1 - 1 
= 2 • 2 k+1 - 1 
_ 2 fc + 2 — \ 

Summary: The claim holds for the base case k = 0, and if it holds for a number k then it also holds 
for the following number k + 1. Thus it holds for all cases, which concludes the proof. □ 
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2 Regular expressions 



Exercise 2.1. (023) 

Construct regular expressions representing languages, over the alphabet {a, b, c}, in which for every 
string w it holds that: 

(i) The number of a’s in w is even. 

(ii) There are 4z + 1 b’s in w. ( i > 0) 

(iii) \w\ = 3i (i > 0) 

Exercise 2.2. (024) 

A regular expression is ambiguous when there exists a string which can be constructed in two different 
ways from the regular expression. Which of the following regular expressions are ambiguous? 

(i) a((ab)*cd)* U a(ababcb*)*a* 

(ii) aab*(ab)* U ab* U a*bba* 

(iii) aaba* U aaaba U aabba* U a 

Exercise 2.3. (040) 

Consider the regular expression (a(cd)'*b) . 

(i) Find a string over {a, b, c, d) 4 which matches the expression. 

(ii) Find a string over {a, b, c, d} 4 which does not match the expression. 

Exercise 2.4. (041) 

Find a regular expression for the language consisting of alternating zeroes and ones. 

Exercise 2.5. (042) 

Find a regular expression for the language L over E = {a,b} consisting of strings which contain 
exactly two or exactly three 6’s. 

Exercise 2.6. (043) 

Find a regular expression over the language L over the alphabet E = {a, b} consisting of strings 
where the number of 6’s can be evenly divided by 3. 

Exercise 2.7. (044) 

Describe which languages the following regular expressions represents, using common english. 

(i) (0U1)*01 

(ii) 1*01* 

(iii) (11)* 

(iv) (0*10*10*)* 

(v) (0U1)*01(0U1)* 
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(vi) 1*0* 

(vii) (10U0)*(1U10)* 

(viii) 0*(1U000*)*0* 

Exercise 2 . 8 . (022) 

Let E be the following regular expression: 

c*(e U a(aU b Uc)*U(aUbU c)*b)c* 

(i) is L(E) = {a, b, c}*? 

(ii) is L(EE) = {a, b, c}*? 

ns 3 Test Assignment 1. (Lesson ) ( 045 ) 

Find a regular expression which represents the set of strings over (a, b} which contains the two 
substrings aa and bb. 
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Solutions 



Solution 2.1. (023 ) 

(i) (bUc)*((a(bUc)fj 

(ii) (aUc)*b(aUc)*^(b(aUc)*) 4 j 

(Hi) (S 3 )* 

Solution 2.2. (024) 

(i) Ambiguous. For instance, the string a can be constructed by using a ( (ab) * cd) or a(ababcb* ) * a* . 

(ii) Ambiguous. The string abb can be constructed either by ab * or a*bba* . 

(iii) Unambiguous. 

Solution 2.3. (040) 

(i) abab matches. 

(ii) abed doesn’t match, since every c must be preceded by an a in order for the string to match. 

Solution 2.4. (041) 

Alternative 1: 

1. (01)* is the language of zero or more 01. 

2. (1 U a) (01) * is the language of alternating zeroes and ones which ends in 1 . 

3. (1 U a) (01) * (0 + a) is the language of alternating zeroes and ones. 

Alternative 2: (0 U a)(10)*(l + a) according to the above. 

Solution 2.5. (042) 

1. a*ba* is the set of strings with exactly one b. 

2. a*ba*ba* is the set of strings with exactly two b’s. 

3. a*ba*ba*ba* is the set of strings with exactly three 6’s. 

4. The language L is then a*ba*ba* U a*ba*ba*ba* . 

Solution 2.6. (043) 

1. a*ba*ba*ba* are all strings with exactly three 6’s. 

2. ( a*ba*ba*ba*)*a * is then the language L. 
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Solution 2.7. 



(o44 ) 



(i) An arbitrary number of binary characters (0 or 1) precedes the substring 01. 

(ii) Strings that must contain a zero but which otherwise consists only of ones. 

(iii) Strings consisting only of ones and which lengths are even. 

(iv) An arbitrary number of repetitions of a string consisting two l’s and an arbitrary number of 
zeroes in arbitrary positions. 

(v) Strings containing the substring 01. 

(vi) Strings on the form 111 . . . 000 . . ., that is, strings that begins with zero or more ones followed 
by zero or more zeroes. 

(vii) • (10 U 0)* is all strings which doesn’t contain the substring 11. 

• (1 U 10)* is all strings which doesn’t contain the substring 00. 

so the concatenation of these is all strings where each occurrence of 00 precedes all occurrences 
of 11. 

(viii) A1 strings which doesn’t contain the substring 101. 

Solution 2.8. ( 022 ) 

(i) No. See for instance the string ba which cannot be produced by the regular expression. 

(ii) Yes. 

ca? S olution test assignment 1. ( 045 ) 

(a U b)* ((aa(a U b)*bb) U ( bb(a U 6)*aa)) (a U b)* 
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3 Finite Automata 



Exercise 3.1. (oi6) 

Let M be the following DFA. 




(i) Write down four string accepted by M and the sequence of configurations that shows this. 

(ii) Write down four strings not accepted by M. 

Exercise 3.2. (ois) 

Let M be the following DFA. 




(i) Write down four strings accepted by M and the sequence of configurations that shows this. 

(ii) Write down four strings not accepted by M. 

Exercise 3.3. (ois) 

Construct a DFA which accepts the following language: 

L = {«; | w € £* A w contains the substring 0101} 

That is, w = xOlOly for two arbitrary strings x and y. 

Exercise 3.4. (o36) 

Construct finite automata, deterministic of non-deterministic for the following regular expressions. 



(i) a(a U b)*b 

(ii) 1(1U0)*0U0 

(iii) 0*10* 
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(iv) (0U1)*1(0U1)* 

(v) 0*(1 U e)0* 

(vi) (0U1)(0U1)(0U1)* 

(via) a(a U 6)*a U b(a U b)*bU aU b 

Exercise 3.5. (037) 

Formally define the automata you constructed in Exercise 3.4. 

Exercise 3.6. (017) 

Which languages are accepted by the following automata: 
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(Vi) start 

Exercise 3.7. (020) 

Find DFA’s which accepts the following languages: 

(i) Strings over {a, 6} ending in aa. 

(ii) String over {a, 6} containing three consecutive a’s (that is, contains the substring aaa ). 

(iii) Strings over {a, b} not containing the substring aaa. 

(iv) All strings over {a, b} where each string of length 5 contains at least two a’s. 

(v) The set of strings over {a, b} where each pair of a’s immediately succeeds a pair of b’s. 

(vi) The strings e, 001 and 000101. 

(vii) Strings over {0,1} beginning with a 1 and which, if interpreted as a binary number, is a multiple 
of 5. As an example, the string 1010 should be accepted since 1010 2 = IO10, which is a 
multiple of 5. On the other hand, the string 1110 should not be accepted, since 1110 2 = 14io 
which is not a multiple of 5. 

(viii) {w € {a, b}* | |io| a = 2 n, \w\b = 2m, m, n € N} 

(ix) {«; e {a, b}* \ w \ „ = 3 n, n £ N} and w doesn’t contain the substring aba. 

Exercise 3.8. (025) 

Construct finite automata which accepts the languages defined by the following regular expressions. 

(i) a*ba*ab* 

(ii) b((aab* U a 4 )b)*a 

(iii) ab^((ba)* U bbb)* U a^ b 

Exercise 3.9. (029) 

Construct a DFA which is equivalent to the following regular expression: 
(00U(1U01)(11U0)*10)* 

Exercise 3.10. (019) 

Prove that the following automation accepts the language L = {a l V . aV | i,j > 1}. 
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Exercise 3.11. 



Convert the following finite automata to GFA’s and find out which regular expressions which de- 
scribes the automata’s languages. 



b 




Exercise 3.12. (027 



Construct regular expressions which are equivalent to the following (deterministic) finite automata: 






Exercise 3.13. (030) 

Construct a regular expression representing the following automation. 



Exercise 3.14. (o 38 ) 

Convert each of the NFA’s below to an equivalent DFA by subset construction. 
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Exercise 3.15. 

Convert the following NFA to a DFA. 




Exercise 3.16. 

Minimise the DFA below by set partitioning or double complement. 




Exercise 3.17. 

Minimise the following DFA and prove that your new DFA really is 




Exercise 3.18. 



(o64 ) 



Let Li and L 2 be languages represented by the following automata. Construct a minimum DFA 
representing L\ U L 2 . 




b® Test Assignment 2. (Lesson ) ( 028 ) 

Construct regular expressions for the automation defined in exercise 3.1. 

nap Test Assignment 3. (Lesson ) (on) 

Convert the following NFA to an equivalent DFA by subset construction. 




b® Test Assignment 4. (Lesson ) ( 072 ) 

Construct the minimum DFA which is equivalent with the following regular expression: 

((a U b)* U )(ac)*b U a 

Hint: The DFA has 8 states. 
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Solutions 



Solution 3.1. (oi6 ) 

(i) 1. a:(l,a#},<2,#} 

2. bba: (l,bba#), (4,ba#), (4, a#}, (3, #} 

3. baa: <l,baa#}, (4, aa#}, (3, a#}, (2, #} 

4. baba: (l,baba#}, (4, aba#}, (3,ba#}, (1, a#}, (2, #} 

(ii) 1. £ 

2. bbbb 

3. aaaa 

4. abab 

Solution 3.2. (ois) 

(i) 1. aa:(l,aa#),(2,a#),(3,#} 

2. bb: (1, bb#}, (5, b#), (4, #} 

3. aaab: (1, aaab#), (2, aab#}, (3, #ab), (5, #b), (4, #) 

4. bbba: (l,bbba#), (5, bba#}, (4, #ba), (2, #a), (3, #) 

(ii) 1. £ 

2. a 

3. aaab 

4. bbb 

losningo!5 



0,1 




Note that if the input string w exists in the language L, we eventually will find the substring 0101 
in w. We define the states corresponding to how much of the substring 0101 has been observed: 

• State 0 is the initial state. In this state no part of the substring 0101 has been observed. 

• In state 1, the first zero of 0101 has been observed. If a zero turns up, we continue to search for 
the next one in the substring 0101. 

• In state 2, the substring 01 of 0101 has been observed. If the next character is a one, we return 
to the initial state since Oil is not a substring of 0101. 

• In state 3, the string 010 has been observed. If we add another zero, we return to state 1 - this 
because the new zero may be the start of a new substring 0101. 

• In state 4 we have observed the full substring 0101 . Thus, it doesn’t matter what other characters 
there are in the string, since we already have concluded that the string exists in the language. 
Consequently, state 4 is an accepting state. 
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(ii) M = ({1, 2,3}, {0, 1}, {1}, {3}), 



a 0 l s 

1 3 2 

2 2,3 2 

3 

a 0 1 

(iii) M = ({1,2}, {0,1}, <5, 1,{1, 2}), 1 1 2 

2 2 

cr Ole 

(iv) Af= ({1,2}, {0,1}, <5, {1}, {1,2}), 1 1 1,2 

2 2 2 

cr 0 1 £ 

(v) Af= ({1,2}, {0,1}, (5, {1}, {1,2}), 112 2 

2 2 

a a be 

(vi) M = ({1, 2, 3}, {a, b}, S, {1}, {3}), \ ^ 2,3 

3 

cr a be 

1 2,3 3,4 

(vii) Af=({l,2,3,4},{a,&},<f,{l},{3}), 2 2,3 2 

3 

4 4 3,4 

Solution 3.6. (017) 

(i) The strings with an even number of characters and length of at least 2. 

(ii) The strings which contains at least one a, followed by an odd number of characters. 

(iii) The strings that ends in 101. NOTE: or? 

(iv) The strings on the form 0"l m , n, m > 0. 

(v) The strings that ends in a. 

(vi) The strings that doesn’t start with a b. 

Solution 3.7. (020) 



b 
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Convert the NFA to a DFA (join the obviously equivalent states (5, 2} and (4, 6}). 




Solution 3.10. (019) 

We construct a GFA from the finite automation. 

1 . Eliminate states 2 and 6, new transitions (1 , 3) , (1 , 4) , (1 , 5) , (1 , 4) . 




2. Eliminate state 4. Since there are no paths from state 4 (dead state) we can remove it entirely, 
b 




3. Create a new accepting state, number 7, and make new epsilon-transitions from the previous 
accepting states to 7. We do this to be able to eliminate state 3 and 5. 
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5. Join the two transitions from 1 to 7. 




Thus, the automation accepts the language a + 6 + Ua + c + , which is the same as {aW ,aV | i,j > 1}, 
which concludes the proof. □ 

Solution 3.11. (035 ) 

(i) b* a 

(ii) b*a(a U 6)* 

(iii) (a U baa)*b(a U e) 

Solution 3.12. (027) 

Construct GFA’s to create regular expressions, see Sailing page 48 and forward. 

(i) (b + a(b U ab*a )) * ( ab * U b + a(e U ab *)) 

(ii) [ab U aa(aa)*ab) (a U aa(aa)*(e U a)) 

(iii) a(aaUa&)* 

(iv) b*a(b + ab*a)* 



Solution 3.13. (030) 

We start by constructing the corresponding GFA. 

0U11 0U10 

1. New final state, remove state 2. 




0U11U10(0U10)*11 




We now obtain the regular expression (0 U 11 U 10(0 U 10)*ll)*10(0 U 10)*. 
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1. Initial state: {gO, ql} 

2 . {qO,ql}^{ q l} 

3. {gO,gl} {ql,q2,q3} 

4. {gl} {} (dead) 

5. {ql}JU{ql,q3} 

6. {ql,q3} {q0,ql,q3} 

7. {ql,q3} {ql,q2,q3} 

8. {q0,ql,q3} {ql,q2,q3} 

9. {q0,ql,q3j {ql,q2,q3j 

10. {ql,q2,q3j {ql,q2,q3j 

11. {ql,q2,q3} {ql,q2,q3j 



Solution 3.15. 

We use the subset construction. 




Step by step: 

1. Initial state: {0} 





3. 


{0} {0} 


4. 


{0,1}^ 


{0,1,2} 


5. 


{0,1} 


{0} 


6. 


{0,1,2} {0,1, 2, 3} 


7. 


{0,1,2} -^{0} 


8. 


{0,1, 2, 3} 


{0,1, 2, 3} 


9. 


{0,1, 2, 3} 


— {0,3} 


10. 


{0,3} ^ 


{0,1,3} 


11. 


{0,3} 


{0,3} 


12. 


{0,1,3} {0,1, 2, 3} 


13. 


{0,1,3} {0,3} 



Solution 3.16. (039) 

We show how to solve the problem both by using set partitioning and by using the double comple- 
ment. 



Set partitioning: We use the minimisation algorithm from Sailing, page 60. 
Level 0 {A, B, C, D, E, 0} 



Level 1 


{A, B, C, D, E} 


m 


The state 0 is non-accepting and the states 
A, B, C, D, E are accepting. 


Level 2 


{A, C, D, E} 


{B} 


m 


The state B transfers to 0 by the string a, 
but the states A, C , D. E transfers to states 
in {C, D, E}. 


Level 3 


{C, D, E} {A} 


{B} 


m 


A transfers to 0 by the string aa, but 
C, D, E transfers to states in {D, E}. 


Level 4 


{C,D,E} {A} 


{B} 


{0} 


No further states can be distinguished. 
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Double reversing: We use the double reversing algorithm from Sailing, page 62. 



• First we reverse the automation and obtain an NFA M rev . 




• The next step is to convert M rev to a DFA M' using subset construction. 

1. First we create a new state from the set of states in the NFA which are reachable by 
consuming e from some initial state. Note that the new state is an accepting state, since 
the state A is accepting. 




2. Then, we create a new state representing the reachable states from {A, B, C, D, E} by 
consuming an a. The reachable states are {A, C, D, E}. 




3. Now we find the states reachable by consuming a b from a state in {A, B, C, D, E}. 




4. We carry on in the same manner, but from some state in {A, C, D, E}. 




5. Over again, but from a state in {C, D, E}. 
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6. Now, we have obtained an equivalent DFA M' since no more transitions need to be ex- 
amined. 

• Now we reverse M' to obtain an NFA ( M') rev . 
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Solution 3.17. 

We solve the problem by subset construction, Sailing page 60. 



(08O) 



Level 0 


{1,2, 3, 4, 


5, 6, 7, 8} 


Start 


Level 1 


{2, 4, 6, 8} 


{1,3, 


5,7} 


The states 2,4,6 and 8 are accepting and the 
states 1,3,5 and 7 are not. 


Level 2 


{2, 6, 4, 8} 


{1,3} 


{5,7} 


The states 1 and 3 transfers to rejecting 
states by the string b , but the states 5 and 
7 transfers into accepting states. 


Level 3 


{2,6} {4,8} 


{1,3} 


{5,7} 


The states 2 and 6 transfers into acceptingt 
states by the string ab, but the states 4 and 8 
transfers into rejecting states by the string. 


Level 4 


{2,6} {4,8} 


{1,3} 


{5,7} 


No further distinguishing can be done. 




Solution 3.18. (o 64 ) 

Interpret the automata as one NFA and convert this to a DFA using subset construction. 
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Step by step: 



1. Initial state: {0, 5} 

2. {0,5} {2,5,6} 

3. {0,5}^U{4} 

4. {2,5,6} {4,5,6} 

5. {2,5,6} {3,7} 

6. {4} ^{4} 

7. {4} ^{4} 

8. {4,5,6} {4,5,6} 



9. {4,5,6} {4,7} 

10. {3,7} {3,6} 

11. {3,7} {3,5} 

12. {4,7} {4,6} 

13. {4, 7} {4, 5} 

14. {3,6} {3,6} 

15. {3,6} {3,7} 

16. {3,5} {3,5,6} 



17. {3,5} {3,7} 

18. {4,6} {4,6} 

19. {4,6} {4,7} 

20. {4,5} {4,5,6} 

21. {4,5} — ^ {4,7} 

22. {3,5,6} {3,5,6} 

23. {3,5,6} {3,7} 




Minimisation: 



Level 0 
Level 1 
Level 2 
Level 3 
Level 4 



{A,B,C,D,E,F, 
{ A,B,C,D,I , J} 
{A,<?} {B,D,J,J} 

{^1} {67} 

{67} {B,D,I,J} 



G, H, I, J, K} 

{E, F, G, H, K} 
{E,G,H,K} {F} 
{E,G,H,K} {F} 
{E,G,H,K} {F} 



Accepting or not 

foo 

bar 

No further distinguishing possible. 



The new DFA: 




Solution test assignment 2. 

TBA. 



(o28) 
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^Solution test assignment 3. 



(oil) 




Step by step: 

1. Initial state: {1, 2} 

2. {1,2} -2-> {1,2,3} 

3. {1,2} {1,2} 

4. {1,2,3} {1,2, 3, 4} 

5. {1,2,3} -U {1,2} 

6 . { 1 , 2 , 3 , 4 } { 1 , 2 , 3 , 4 } 

7. {1,2, 3, 4} {1,2} 

Solution test assignment 4. (072) 

TBA. 
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4 Regular Grammar 



(o46) 



Exercise 4.1. 

Try to construct a regular grammar which generates the language {a n b n \ n € N}. Explain why you 
don’t succeed! 

Exercise 4.2. 

Create regular grammar for the following languages. Use S as the first non-terminal 
created the grammars, construct the corresponding NFA’s. 

(i) {w\we {0,1}* Aw containts the substring 00 or 1 1} 

(ii) l*u 10 * 1*0 

Exercise 4.3. (049) 

Create NFA’s for the following languages and construct the corresponding regular grammars using 
these. Use S as the first non-terminal. 

(i) (eUl)(lU0)*01* 

(ii) 0*1* U (00)* 

(iii) {w | w e {0, 1}* A w doesn’t end in 01} 

Exercise 4.4. (047) 

Create a regular grammar for the language of string over (a, b} which contains the substrings aa and 
bb. 

Exercise 4.5. (oso) 

Construct regular grammars for the following languages. 

(i) (aabb* aab* a)* U ( ba*b ) 

(ii) {a n b m | n + m = 2k + 1 A n, to, k e N} 

Exercise 4.6. (077) 

Construct a regular grammar which produces the language over the alphabet S = {a,b} consisting 
of all strings with at most 3 o’s. Show how the string babbaab is produced. 

Exercise 4.7. (o78) 

Construct a regular grammar producing the following language over the alphabet S = (a, b}: 

L = {w\ (|tu| Q — |tu|f, = 1 (mod 3)} 

Then show how the string abaaaaba is produced. 



(o48 ) 
. When you have 
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Solutions 



Solution 4.1. (o46 ) 

It is logical to start by S' — > a ... as production rule. From the beginning we have to be able to 
generate fe’s to obtain the same number of a’s and b's. However, no other terminals may appear in a 
regular production rule. The context-sensitive grammars — > aSb \ s, on the other hand, is generating 
the language. 



Solution 4.2. 



(o48) 
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Solution 4.4. 



5 - S 1 \S 2 

51 — > O1S1 | bSi | aA 

A — > aS 

B -> aB\bB\bC 
C bD 

D aD \ bD \ e 

5 2 aS 2 | bS 2 | bE 

E —> bF 

F -► aF | bF | aG 
G -* bH 
H -> aH | bH | e 

(o50 ) 



5 -t Si|S 2 
Si — > aafeAi | £ 

Ai -> bAi | Bi 

Si — > aaC\ 

Ci -» feCi | Si 

51 -* aSi 

5 2 -> 6A 2 

A 2 — > aA 2 | S 2 
S 2 -» b 

(ii) Construct strings with odd length consisting of some number of a’s followed by some number 
of b’s. 

aA 0( jd | bB ot id 
aA even | &S even | £ 

^Aodd | bB 0 dd 
6S even I £ 
bB 0 dd 

Solution 4.6. (ow) 



S h 
A odd 

Aeven 

S 0 dd — » 

Seven -► 



Solution 4.5. 

(i) 



bS | aS | £ 

6A | aS | £ 
bB | aC | £ 

6C | £ 

Production of the string babbaab: 

S => bS => 6aS => 6a6A => babbA => babbaB => babbaaC => babbaabC => babbaab 



A Hi 
S h 
(7 



41 




Solution 4.7. 



(o78 ) 



S -► aA | bB 
A -> aB\bS\e 
B -s aS | 

The string abaaaaba is produced as follows. 

S' => a.4 => abS => a&aA => abaaB => abaaaS => abaaaaA => 
abaaaabS => abaaaabaA => abaaaaba 
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5 Non-Regularity 



Exercise 5.1. (079) 

Show that if the languages L 1 and L 2 are regular, then the following language are regular. You can 
use any proof technique you like, and use any proof from the course book and/or the lecture material. 

(i) LiL 2 

(ii) L\ 

(iii) Li U L 2 

(iv) L\, dvs. £* — Li 

(v) L\ n i/2 

(vi) Li — L 2 

(vii) {«; lueL, if and only if w ^ L 2 } 

(viii) prefix(Li) (seeNotation) 

(ix) suffix(Li) (see Notation) 

(x) substring(Li) where substring(L) = {w \ 3 x, y such that xwy e L} 

Exercise 5.2. (068) 

Which of the following languages are regular? 



(i) {a n b m | 


n > m V n < m} 


(ii) {a n b m | 


n > m Vn < m} 


(iii) {a n b m \ 


n > m An < m} 


(iv) {a n b m | 


n>m An < m } 



Exercise 5.3. (034) 

Prove that if L is regular, then L rev is regular. 

Exercise 5.4. (070) 

What can be said about the language L = {w £ {a, b}* \ abw = wba}l is it the empty string? is it 
regular? 

Exercise 5.5. (031) 

Does it hold that a language cannot be regular if it is the union of languages which are non-regular? 
Does it hold for the intersection? For concatenation? If it is true, prove it. If not, give a counter 
example. 
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Exercise 5.6. 



(o82 ) 



Determine if the following languages are regular or non-regular. Prove that your answers are correct. 
You may use properties of regular languages, closure properties of regular languages, things you 
know about DFA’s, NFA’s or regular expressions and grammars. 

(i) K& 2 "} 

(ii) {rn | rue {0,1}* A Mo 7 ^ Mi} 

(iii) {w | w £ {a, b}* A M« = 2 Mb} 

(iv) {w rev | w £ L(a n b m )} 

(v) {xwx rev y | x,w,y£ {a,6}+} 

(vi) {a n b n+m c m | m < 2} 

Exercise 5.7. (032) 

Prove that the following languages are non-regular: 

(i) {0T"0 m+ " | to, n > 1} 

(ii) {ww w £ (0 U 1)+} 

(iii) {(01)"(10)” | n > 0} 

(iv) The set of strings over {0,1} with the equally many zeroes as ones. 

(v) {w e (0 U 1)* \w = w rev } 

(vi) {xx rev w | x, w e (0 U 1)+} 

(vii) {0 ra | n is prime} 

(viii) {0 ra | n is composite} 

(ix) {0 m l n | m ^ n} 

(x) {0 m l ra | gcd(m, n) = 1}, where the function gcd(x, y) is the greatest common divisor. 



Exercise 5.8. ( 081 ) 

Prove that the language L = {a n b n }, n 6 Pi is non-regular using the pumping lemma. 

Exercise 5.9. (062) 

Prove that the language L = {w e {0, 1}* | 3u : www = uu} is non-regular. 

Exercise 5.10. (067) 

Let L be a regular language. What can be said about the following language? 

Li = {w | ww rev ww rev £ L} 

Exercise 5.11. (073) 



a: is a root to w if 3n > 0 such that w = x n . We define the root to a language L as rot(L) = {x \ 
3n>0:x n £ L}. Prove that for each language LC E* it holds that e £ L <*=> £* = root(L). 
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(ol4 ) 



b®' Test Assignment 5. (Lesson ) 

Determine if the following languages are regular or not. 

(i) L a = {a l b>c k | i+j = k} 

(ii) L b = (aV? | 1 < i < j < k < min(81, j 2 )} 
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Solutions 



Solution 5.1. (079) 

(i) Regular languages are by definition closed under concatenation. 

(ii) Regular languages are by definition closed under the Kleene star operation. 

(iii) Regular languages are by definition closed under union. 

(iv) If L\ is regular, it is possible to construct a DFA for the language. Assume that M\ = (Q, E, <5, s, F) 

is such a DFA. Then it is possible to construct a new DFA M[ = (Q. 53 , 6, s. Q — F) which re- 
jects a string w if and only if M i would accept w. /. L{M[) = E* — L(Mi) = E* — L \ = L-\ . 
Since it exists a DFA accepting the language Li, then L\ is regular if Li is regular. Regular 
languages are closed under complement. □ 

(v) DeMorgans law implies that Li fi L 2 = (^ L\ U . Since regular languages are closed under 
complement and union, they are also closed under intersection. 

(vi) Note that L\ — = %% fl L 2 . Since regular languages are closed under intersection and 

complement, they are also close under set difference. 

(vii) Note that {w \ w £ Li om och endast om w £ L 2 } = {L\ — L 2 ) U (L 2 — L\). Since regular 
languages are closed under set difference and union, this set is regular. 

(viii) Assume that Mi is a NFA accepting L \ . Construct a new NFA M[ equivalent to Mi except that 
we add a new state qf, which is the only final state in M[. We also add e-transitions to qf from 
all states from with a final state is reachable in Mi. If wz £ L\, then w is consumed by the 
regular transitions in Mi, followed by one of the e-transitions to qf to simulate the consumption 
of z, without actually consuming something. Thus, L(M [ ) = prefix (L(Mi)) = prefix(Li), 
that is, if Li is regular, then pref ix(Li) is also regular. 

(ix) Assume that Mi is a NFA accepting L\. Construct a new NFA M[ equivalent to Mi except 
that we add a new initial state qo. In addition, we add e-transitions from go to all states where 
an initial state is reachable. If xw £ L t , then the consumption of x is simulated by a transition 
from the first set of e-transitions and w is consumed by the regular transitions in Mi . Thus, 

L ( M[ ) = suffix{L(Mi)) = suffix (Li), and so it holds that if L i is regular, then suffix (L-i) is 
also regular. 

(x) Assume that Mi is an NFA accepting L\. Construct a new NFA M[ equivalent to Mi except 
that we add a new initial state go and a new final state g/, which is the only final state in M[. 

In addition, we add e-transitions go to all states from which an initial state can be reached, and 
e-transitions g/ from all states from which a final state can be reached in Mi. If xwy £ Li, 
the consumption of x is simulated by a transition from the first set of e-transitions, w is then 
consumed by the regular transitions of M i and the consumption of y is simulated bya transition 
from the second set of e-transitions. Thus, L(M[) = delstrang(L(Mi )) = delstrang(Li), and 
hence it holds that if Li is regular, then delstrang(Li) is also regular. 

Solution 5.2. (068) 

• Not regular. It is the same language as {a n b m \ n ^ m} which is the complement to {a n b rn \ 
n = m} which is not regular (which you should have discovered earlier). Since regularity 
is closed under complementation, the considered language can not be regular (since it would 
imply that {a n b m \ n = m} is regular). 

• Regular. The language can be described by the regular expression a *b*. 

• Regular. This language contains no language, and is thus regular. 
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Solution 5.3. 



Since L is regular, there exists a DFA accepting this language. If we reverse all transitions in this 
automation, and turn the initial state into an accepting state and all accepting states to initial states, 
the result is that we run the original automation “backwards”, and the new automation will accept the 
language L rev . Since there exists an finite automation accepting L m , the language is regular. 

Solution 5.4. (070) 

It is not the empty set since, for instance, w = a and w = aba satisfies abw = wba. If we look a bit 
closer at this condition we notice that w has to start with ab since this is the prefix on the left hand 
side. In the same way, the strings has to end in ba since that is the postfix on the right hand side. 
However, we can make an exception for the string a which may act both as initial and final a and 
“borrows”, so to speak, its b’s from the prefix and postfix strings in the condition. Thus, the string a 
satisfies the condition aba = aba. We now try to find some larger examples. The strings are supposed 
to start with ab, so we had to add a b to our minimal string a. In addition, we had the requirement 
that the strings should end in ba, but we can let the final b be the same as the initial one to obtain 
the string aba. If we keep on adding characters to this string, the requirement that they should end in 

ba yields the following strings ababa, abababa, ababababa, As can be seen, there is a regularity 

in the strings which can be summarised as: the strings begins with an a followed by zero or more 
ba’ s. This can be formulated as the regular expression a(ba) *. Since there exists a regular expression 
describing the language, it is regular. 

Solution 5.5. (o3i) 

Languages formed by the union of two non-regular languages may be regular. For instance, this is the 
case with the language constructed by the union of the two non-regular languages {a n b m \ n = m} 
and {a n b m \ n ft m}, which is described by the regular expression a*b*. 

The intersection of two non-regular languages may also result in a regular language, for instance 
the non-regular languages a n b n and b n a n , which intersection only contains the empty string, which 
is regular. 

Concatenation of two non-regular languages will however always result in a non-regular language. 
If we assume the contrary, that is, that it is possible to create a regular language by concatenation of 
two non-regular languages, there would be a finite automation accepting this language. Concatenation 
of languages corresponds to concatenation of two automata accepting respective language. Thus, it 
would be possible to split this finite automation into two separate automata accepting each of the 
original languages, which is a contradiction since these languages are non-regular which means there 
cannot exist finite automata accepting them. 

Solution 5.6. (082) 

• Non-regular. Assume that L = {a n b 2n } is a regular language. Then there exists a DFA 
M accepting L. Create a new DFA M' equivalent to M except that all pairs of transitions 
<5(gl, b) = (ft and 8{(ft, b) = (ft are changed to a single transition S(ql, b) = eft (two transi- 
tions in M consuming two b’s are replaced by a transition consuming one b). M ' consumes half 
as many b’s as M, L(M') = {a n b n } which has to be regular, since there is a DFA accepting 
this language. This is a contradiction since we know that {a n b n } is non-regular. Thus, L is also 
non-regular. 

• Non-regular. Assume that L = {w \ w € {0,1}* A vj | o ft w | i } is regular. Since regular 
languages are closed under complementation and intersection, ->(L) nL(0 * 1 1 has to be regular. 
However, ftL) fi L(0*1*) = {0"l n }, which is not regular, and thus neither is L. 

• Non-regular. Assume that L = {vj \ w G {a, b}* A \ w\ a = ftuft,} is regular. We know from the 
closure properties of regular languages that L n L(a*b*) is regular. However, L n L(a*b*) = 
{a n b 2n }, which is not regular. Hence, L is cannot be regular. 
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• Regular. The regular expression b*a* describes the language. 

• Regular. Theregularexpressiona(aU6) + a(aU6) + U6(aU6) + 6(aU6) + describes this language. 
Note that w can have an arbitrary length, so it may consume the trail of x and the beginning 
of x rev . The only requirement is that the first character in x (which must contain at least one 
character) is present in the rest of the string. 

• Non-regular. This language can also be expressed as L = {a n b n } U {a n b n+1 c} U {a n b n+2 c 2 }. 
Assume that L is regular. Then, L n {a*b*} has to be regular. However, L n {a*b*} = {a n b n }, 
which is non-regular, thus L cannot be regular. 

Solution 5.7. (032) 

• {01"0" +m | m, n > 1} = {0}{l"0"}{0 m }. Since we know that {1"0"} and {0” 1 } are non- 
regular and that concatenation of non-regular languages are non-regular, the language cannot 
be regular. 

• The language is non-regular. Since each pair of the infinitely many strings over (0 U 1) + are 
distinguished by the language, that is, each pair x, y £ (0 U 1) + ) where x ^ y is distinguished 
when xx belongs to the language and yx doesn’t. 

• The fact that this language is non-regular might be simpler to understand if we let a = 01 and 
b = 10 resulting in the language {a n b n \ n > 0} which is known to be non-regular. 

• Call this language L and assume that it is regular. Then L n (0*1*) is also regular since regular 
languages are closed under intersection. However, this is a contradiction since L n L(0 *1*) = 
{ 0 n l"}, w }ji c h is non-regular. Thus, L cannot be regular. 

• This language distinguishes an infinite number of strings and it therefore non-regular. Every 
pair of the infinitely many strings x, y £ (0 U 1) + where x ^ y are distinguished when xx rev 
belongs to the language but not yx rev . 

• The language has to distinguish infinitely many strings. The proof is similar to the solution 
above. For each of the infinitely many strings x, y e (0 U 1) + where x ^ y, there exists a string 
z = x rev such that xzw belongs to the language, but not yzw. Hence infinitely many strings are 
distinguished, and the language is not regular. 

• Also here it is useful to use argumentation about distinguishing strings. However, we need to 
make slightly stricter requirements on the strings since it otherwise would not be obvious that 
the language distinguishes between them. Take two strings x = 0 n and y = 0 n+1 . The number 
of such strings are infinite. Since there exists an infinite number of prime numbers, there exists 
a string z = 0 m such that n + m is prime, but not (n + 1) + m (except when n = m = 1 
where both 2 and 3 are prime). Then xz belongs to the language, but not yz. The language then 
distinguishes an infinite number of strings, thus making it non-regular. 

• Composite numbers are numbers which are not prime. Thus, this language is the complemen- 
tary language to the language or prime strings in the previous exercise, which was non-regular. 
Considering this and the closure properties of complementation, we can derive that this lan- 
guage cannot be regular. 

• This language is the complement language to {0 ra l"} which is non-regular. By the closure 
properties for complementation we know that also this language is non-regular. 

Solution 5.8. (081) 

We start by taking a look at the pumping lemma. 

The pumping lemma states properties that must hold for all strings in a regular language (with 

an infinite number of strings). The lemma states that all strings in such a language, which has a 




length exceeding a certain number, contains a non-empty substring which may both be “pumped up” 
(repeated) an arbitrary number of times or “pumped out” (removed) such that the resulting string still 
is in the language. In addition it holds that such a substring exists in each substring of a string in the 
language, if the substring has a length equal to or exceeding the number stated above. 

In other words, it holds that all strings w £ L such that |wj > N can be re-written as w = xyz 
where \y\ / 0, and for all i £ N it holds that xy l z £ L. In addition, it holds that all substrings of w 
where | w\ > N has such a “pumping block” y. 

To prove that a language is non-regular using the pumping lemma, it is easiest to try to construct 
a counter example. Thus, we start by assuming that the language L = {a n b n } is regular, and try to 
achieve a contradiction. We will do this by observing an individual string in L and show that it is not 
possible to pump it using the pumping lemma. 

Thus, according to the pumping lemma it holds that each string in L having a length greater tha 
or equal to an unknown number N should have a “pumping block” y which can be pumped. Since 
we have no idea what N would be, we have to be careful and select a string which is guaranteed to 
have a length which is equal to or greater than N. 

The only way we can do this is to choose a string which is parameterised in N . For instance, we 
can consider the string w = a N b N , for which it naturally holds that vj > N. 

The next step is to find out exact where in the string w = xyz the pumping block y is. Here we 
could use the cact that a pumping block has to exist in all substrings of w where w > N. 

Consider the substring a N , for which it holds that a iY | > N. Thus, a N must contain a y such that 
w = xy z z, i £ N. So we have w = a N ~ k a k b N where x = a N ~ k , y = a k and z = b N . We attempt to 
pump out the pumping block y once, and obtain the string a = a N ~ k b N where k > 0. This string is 
supposed to belong to the language L according to the pumping lemma (under the assumption that L 
is regular). But since we the number of a’s and the number of b’s are not the same, the pumped string 
cannot be in the language! We have found a contradiction, and have to reject our hypothesis that L 
would be regular. Hence, L is non-regular. □ 

Solution 5.9. ( 062 ) 

We use the pumping lemma also in this proof. The string w = 10 - 110"1 belongs to the language, 
since www = uu with u = I Q ,v 1 1 0 ;Y 1 10 ;Y 1 . If the starting string of zeroes is pumped, which 
according to the pumping lemma should be possible since the length of this string is greater than or 
equal to N, the string is no longer in the language. That is, if tu = 10 2Jv 110 Ar l then www ^ uu. 
Thus, the language is non-regular. 

Solution 5.10. (o67) 

The strings in the language consist of a concatenated pair of the substring ww rev . Thus, the strings are 
the same backwards and forwards, i.e., they are palindromes. Palindrome languages are not regular, 
and as we saw earlier, it is not possible to obtain a regular language by concatenating two non-regular 
languages. 

Solution 5.11. ( 073 ) 

We have that root(L) = {x | 3n > 0 : x n £ L}. Assume that e £ L and let n = 0. This results in 
root(L) = {x | x° £ L} and since we no longer have any requirements on which strings x which 
should belong to root(L) we have that root(L) = £*. Thus, we have shown that s £ L => £* = 
root(L). 

Now assume that S* = root(L). We want to show that this implies e £ L. Since e £ E*, it has 
to hold that 3n > 0 : e" £ L. No matter how many times we repeat s, the result is again e, i.e., Vn > 
0 : e n = e. Thus, it must hold that e £ L. Consequently, we have shown that e £ L 4= E * = root(L) 
and this together with the paragraph above yields that e £ L E * = root(L). 
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^Solution test assignment 5. 



( 014 ) 



• L a is non-regular. Assume that L a is regular for contradiction. Then there should be a number 
N such that if w £ L a and vj > N, then w contains a non-empty substring y which can be 
pumped such that w stays in the language. Such a string exists, as a matter of fact, in every 
substring of w with a length of at least N characters. Consider the string a N b N c 2N which 
belongs to the language since N + N = 2N. Let the beginning sequence of a’s be the pumping 
string y, which satisfies the condition in the pumping lemma since \y\ > N. If we pump up 
y once, the string no longer belongs to the language, since 2N + N / 2N. Thus, we have 
achieved a contradiction, and L a cannot be regular. 

• L), is regular. Each string in Lb may consist of maximum 81 of each of the characters a, b and 
c. This makes the number of strings in Lb finite, thus regular. 
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